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BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 
10 The present invention relates generally to systems and methods for video 

communication. More specifically, the present invention relates to a webcam-based 
interface for initiating two-way video communication. 

DESCRIPTION OF RELATED BACKGROUND ART 
15 In recent years, a growing number of personal computers and interactive 

television systems are equipped with digital video cameras. Such cameras may be 
used for two-way video communication (videoconferencing) between systems 
connected by a communication network, such as a local area network (LAN) or the 
Internet. 

20 Digital video cameras may also be used for one-way video communication. 

For example, a camera may be configured as a "web camera" or "webcam." A 
webcam captures a continual stream of video images and broadcasts the images via 
the Internet to any requesting Web browser. In the early days of the Internet, 
webcams were used to monitor coffee pots and fish tanks. Today, thousands of 

25 webcams are used in such diverse applications as monitoring freeway traffic (e.g., 



"traffic cams") and monitoring children at day care centers (e.g., "kiddie cams" or 
"cradle cams 11 ). 

Current videoconferencing systems are deficient in a number of respects. For 
example, before a video connection is established, a user of one system cannot 
5 visually determine whether a user of another system is within the proximity of the 
camera and thus able to accept the video communication. Thus, many wasted 
communication attempts are made when a receiving party is not available. 

Additionally, situations exist in which a user of one system may wish to 
monitor the output of a camera of another system before deciding whether to initiate 
10 video communication. For example, parents may wish to periodically monitor the 
activities of their children and only initiate communication when the necessity arises. 
Similarly, a child caring for an aging parent may wish to periodically monitor the 
parent's condition using a digital video camera, changing the one-way video 
monitoring into two-way video communication if their assistance is required. 

15 Indeed, it would be desirable for an individual to be able to monitor the output 

of multiple cameras at different locations, e.g., day care, parent's home, etc., while 
being able to selectively establish two-way video communication with one of the 
locations. Current videoconferencing systems do not provide such a monitoring 
feature prior to the establishment of a video communication channel. 

20 Using a videoconferencing system to create a permanent video 

communication channel between two systems may be undesirable for a number of 
reasons, including cost, privacy, and security. For example, parents monitoring their 
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children from work may not wish their children to have a permanent window into their 
offices. Additionally, creating a two-way video communication channel is expensive 
in terms of communication bandwidth, limiting a videoconferencing system to only a 
few simultaneous connections. 

5 Webcams provide a mechanism for one-way video monitoring of multiple 

video streams. Unfortunately, the one-way nature of webcams make them unsuited 
for video communication. No hybrid webcam-monitoring and videoconferencing 
system currently exists. 

Accordingly, what is needed is a system and method for allowing a user of 
10 one system to visually determine whether a user of another system is available 
before attempting to establish two-way video communication between the systems. 
What is also needed is a system and method that allows the user of the first system 
to periodically monitor the output of a camera of the second system, while allowing a 
user to selectively establish a two-way video communication channel with the 
15 second system if desired. Indeed, what is needed is a system and method for 

converting a one-way video monitoring session into a two-way video communication 
session in response to a user selection of a displayed webcam. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 Non-exhaustive embodiments of the invention are described with reference to 

the figures, in which: 
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FIG. 1 is a block diagram of a system supporting one-way and two-way video 
communication; 

FIG. 2 is an illustration of an interactive television system including a remote 
control, a camera, a set top box, and a television; 

5 FIG. 3 is a block diagram of physical components of a set top box; 

FIG. 4 is a block diagram of logical components of a system for providing a 
webcam-based interface for initiating two-way video communication; 

FIG. 5 is a block diagram of an system for video stream compositing 
implemented within a cable head-end; 

10 FIG. 6 illustrates a plurality of video streams displayed in a grid format; 

FIG. 7 illustrates a video communication window; 

FIG. 8 illustrates a video communication window displayed with a plurality of 
video streams; and 

FIG. 9 illustrates a plurality of video streams displayed in a ticker format. 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention solves the foregoing problems and disadvantages with 
a webcam-based interface for initiating two-way video communication. In one 
20 embodiment, a terminal receives and simultaneously displays a number of video 
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streams. Each video stream may be generated by a camera associated with a 
different terminal in a network. 

The video streams may be displayed on the terminal in various formats, such 
as a grid format or a ticker format. For instance, in the grid format, the video 
5 streams are arranged in rows and columns on the terminal's display screen. In the 
ticker format, the video streams move across the display screen in a carousel 
fashion (e.g., like a stock ticker). Any number of video streams may be displayed in 
either format, limited only by bandwidth considerations. 

Preferably, a user may select one of the displayed video streams. In 
10 response to the user selection, a two-way video communication may be established 
between the terminal on which the video streams are displayed and a terminal from 
which the selected video stream originated. 

Accordingly, a user of one system may visually determine whether a user of 
another system is available before attempting to establish two-way video 
15 communication between the systems. Additionally, a user of the first system may 
periodically monitor the output of a camera of the second system and selectively 
establish two-way video communication with the second system if desired. This is of 
particular utility to parents and other caregivers. 

In one embodiment, the displayed video streams are cached within the 
20 receiving terminal such that the streams may be viewed at a subsequent time. 
Alternatively, the video streams may be cached within an intermediate network 
node, such as a cable head-end, or at the originating terminal. Thus, in one 
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embodiment, a user may select one of the displayed video streams and request 
playback of an earlier-in-time segment of the video stream. 

Reference throughout this specification to "one embodiment" or "an 
embodiment" means that a particular feature, structure, or characteristic described in 
5 connection with the embodiment is included in at least one embodiment of the 

present invention. Thus, appearances of the phrases "in one embodiment" or "in an 
embodiment" in various places throughout this specification are not necessarily all 
referring to the same embodiment. 

Furthermore, the described features, structures, or characteristics may be 
10 combined in any suitable manner in one or more embodiments. In the following 
description, numerous specific details are provided, such as examples of 
programming, user selections, network transactions, database queries, database 
structures, etc., to provide a thorough understanding of embodiments of the 
invention. One skilled in the relevant art will recognize, however, that the invention 
15 can be practiced without one or more of the specific details, or with other methods, 
components, materials, etc. In other instances, well-known structures, materials, or 
operations are not shown or described in detail to avoid obscuring aspects of the 
invention. 

Throughout the following disclosure, the term "coupled" may be used to refer 
20 to components that are either directly connected to one another or that are linked by 
one or more other components. Thus, as used herein, the term "coupled" may be 
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synonymous with "in electrical communication with" or simply "in communication 
with." 

Referring now to FIG. 1, there is shown system 100 supporting both one-way 
(e.g., webcam) and two-way video communication. In one implementation, the 
5 system 100 relies on a broadband cable network 102. However, other networks are 
contemplated, examples of which include a satellite network (not shown), a 
telephone network 104, or a packet-driven network, such as the Internet 106. 

In one configuration, the system 100 includes a plurality of set top boxes 
(STBs) 108 located, for instance, at customer homes. Generally, an STB 108 is a 
10 consumer electronics device that serves as a gateway between a customer's 

television 110 and the network 102. In alternative embodiments, an STB 108 may 
be embodied more generally as a personal computer 1 12, an advanced television 
110 with integrated STB functionality, or another type of client terminal. 

An STB 108 receives encoded video signals and other information from the 
15 network 1 02 and decodes the same for display on the television 1 1 0 or other display 
device (such as a computer monitor, flat panel display, or the like). As its name 
implies, an STB 108 is typically located on top of, or in close proximity to, a 
television 1 1 0. 

An STB 108 may be coupled to a video camera 114, which captures video 
20 information for the STB 1 08. The video camera 1 14 may be embodied as digital 
video camera having a charge-coupled device (CCD) array. Digital video cameras 
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are well known in the art, such as the Logitech QuickCam Pro USB\ available from 
Logitech, Inc. of Fremont, California. 

The video camera 1 14 may be used for both one-way and two-way video 
communication. For example, the video camera 1 14 may be configured as a 
5 "webcam", which captures a continual stream of video images and broadcasts the 
images via a communication network, such as the Internet 106, to a requesting client 
module, such as a Web browser. However, the term "webcam" should not be 
construed as being limited only to the World Wide Web component of the Internet. 

In certain embodiments, the video camera 1 14 may include an integrated 
10 microphone (not shown) for capturing audio signals. Alternatively, the microphone 
may be integrated with the STB 108, the remote control 118, the television 1 10, or 
the like. 

The following discussion refers primarily to the capture and transmission of 
video signals. However, those skilled in the art recognize that the term 
15 "videoconferencing" also implies the capture and transmission of audio signals. 
Thus, where video signals and video components are specifically illustrated, audio 
signals and audio components are implied. 

As discussed in further detail below, the STB 108 may convert video signals 
generated by the video camera 114 into various formats for transmission through the 
20 network 1 02. The STB 1 08 may transmit video streams to one or more other STBs 
108, to a personal computer 112, or to another type of terminal. 
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In one embodiment, each STB 108 is coupled to the network 102 via a head- 
end 1 1 6 or other intermediate network node. In the context of a cable network, a 
head-end 1 16 is a centrally-located facility where television programs are received 
from a local cable TV satellite downlink or other source and packaged together for 
5 transmission to customer homes. In one configuration, a head-end 1 16 also 

functions as a Central Office (CO) in the telephone industry, routing video streams 
and other data to and from the various STBs 108 serviced thereby. 

Head-ends 1 1 6 may be coupled directly to one another or through the 
network 102. In some cases, head-ends 116 may be connected via a separate 
10 network, one particular example of which is the Internet 1 06. Two or more STBs 
108 may also be served by the same head-end 116. 

Transmission between head-ends 116 may occur, for example, (i) via a direct 
p eer _t 0 _peer connection between head-ends 116, (ii) upstream from a first head-end 
1 16 to broadband communication network 102 and then downstream to a second 
15 head-end 116, (iii) via the Internet 106, or (iv) via a telephone network 104. For 
example, a first STB 108 may send a video transmission upstream to a first head- 
end 116, then to a second head-end 116, and finally downstream to a second STB 
108. 

In one configuration, video streams are distributed in an encoded format, 
20 such as MPEG (Moving Picture Experts Group) or Video over IP (VoIP). Various 
MPEG standards are known, such as MPEG-2, MPEG-4, MPEG-7, and the like. 
Thus, the term "MPEG," as used herein, contemplates all MPEG standards. 
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Moreover, other video encoding/compression standards exist other than MPEG, 
such as JPEG, JPEG-LS, H.261 , and H.263. Accordingly, the invention should not 
be construed as being limited only to MPEG. Of course, proprietary video encoding 
formats may be used within the scope of the invention. 

5 Each STB 108 may be distinguished from other network components by a 

unique identifier, number, code, or address, examples of which include an IP 
(Internet Protocol) address or a MAC (Media Access Control) address. Thus, video 
streams and other information may be transmitted from the network 102 to a specific 
STB 108 by specifying the corresponding address, after which the network 102 

10 routes the transmission to its destination using conventional techniques. 

The network 102 is also preferably coupled to the Internet 106 to provide 
access thereto by the STBs 108. The Internet 106 is a "network of networks" and is 
well known to those skilled in the art. Communication over the Internet 106 is 
accomplished using standard protocols, such as TCP/IP (transmission control 
15 protocol/Internet protocol) and the like. 

A remote control 1 18 is provided, in one configuration, for convenient remote 
operation of the STB 108 and the television 110. The remote control 118 may use 
infrared (IR), radio frequency (RF), or other wireless technologies to transmit control 
signals to the STB 108 and the television 110. Other remote control devices are 
20 also contemplated, such as a wired or wireless mouse (not shown), a keyboard (not 
shown), or the like. 
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FIG. 2 is an illustration of an interactive television system 200, including a 
remote control 1 18, a camera 1 14, a set top box 108, and a television 1 10 (or other 
display device). As noted, the remote control 1 18 is provided for convenient remote 
operation of the STB 108 and television 110. Preferably, control signals are 
5 transmitted from a wireless transmitter 202 in the remote control 1 1 8 to a wireless 
receiver 204 in the STB 108 and/or the television 1 1 0. 

In the depicted embodiment, the remote control 118 includes a plurality of 
buttons or similar controls. For instance, the remote control 118 may include a 
power button 206, an up arrow button 208, a down arrow button 210, a left arrow 
10 button 212, a right arrow button 214, a "Select" button 216, an "OK" button 218, 
channel adjustment buttons 220, volume adjustment buttons 222, alphanumeric 
buttons 224, a "Call" button 226, and an "Cache" button 228. The functions of 
certain of the above-identified buttons will be discussed in greater detail below. 

While the following description refers primarily to a broadband cable network 
15 102, the invention is not limited in this respect. A satellite delivery system may also 
be used, such as direct broadcast satellite (DBS) system. A DBS system may 
include a small 18-inch satellite dish (which is an antenna for receiving a satellite 
broadcast signal); a digital integrated receiver/decoder (IRD), which separates each 
channel and decompresses and translates the digital signal for display by a 
20 television; and a remote control. 

Programming for a DBS system may be distributed, for example, by multiple 
high-power satellites in geosynchronous orbit, each with multiple transponders. 
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Compression (e.g., MPEG) is used to increase the amount of programming that can 
be transmitted in the available bandwidth. 

A digital broadcast center may be used to gather programming content, 
ensure its digital quality, and transmit the signal up to the satellites. Programming 

5 may come to the broadcast center from content providers (TBS, HBO, CNN, ESPN, 
etc.) via satellite, fiber optic cable and/or special digital tape. Satellite-delivered 
programming is typically immediately digitized, encrypted and uplinked to the 
orbiting satellites. The satellites retransmit the signal to every earth-station - or, in 
other words, every compatible DBS system receiver dish at customers' homes and 

10 businesses. 

Some programs may be recorded on digital videotape in the broadcast center 
to be broadcast later. Before any recorded programs are viewed by customers, 
technicians may use post-production equipment to view and analyze each tape to 
ensure audio and video quality. Tapes may then be loaded into a robotic tape 

15 handling systems, and playback may be triggered by a computerized signal sent 
from a broadcast automation system. Back-up videotape playback equipment may 
ensure uninterrupted transmission at all times. 

While the following description makes particular reference to cable head-ends 
1 16, it should be recognized that satellite broadcast centers may be used for the 

20 same purpose. Thus, as used herein, the term "broadcast center" may refer 
interchangeably to head-ends 1 16 or satellite broadcast centers. 

FIG. 3 is a block diagram of physical components a set top box 108 according 
to an embodiment of the invention. The STB 108 may include, in one configuration, 
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a network interface 302 for communicating with the broadband communication 
network 102 via the head-end 116. The interface 302 may include conventional 
tuning circuitry for receiving MPEG (Moving Picture Experts Group) packets for a 
selected television channel. The interface 302 may also include conventional cable 
5 modem circuitry for sending or receiving other types of data. For example, the 
interface 302 may conform to the DOCSIS (Data Over Cable Service Interface 
Specification) or DAVIC (Digital Audio-Visual Council) cable modem standards. 

In one configuration, one or more frequency bands (for example, from 5 to 30 

f-i MHz) may be reserved for upstream transmission. Digital modulation (for example, 

=13 

m 10 quadrature amplitude modulation or vestigial sideband modulation) may be used to 

. 

fl| send digital signals in an upstream transmission. 

UH 

=3 Of course, upstream transmission may be accomplished differently for 

O different networks. Alternative ways to accomplish upstream transmission include 

ST? 

using back channel transmission, which is typically sent via an analog telephone 
y[ 15 line, ISDN, DSL, or other techniques. For example, in one embodiment, 

communication with a telephone network 104 may also be accomplished via a 
separate telephone modem 304, such as a DSL (digital subscriber line) or analog 
telephone modem. 

The STB 108 also preferably includes a CODEC (encoder/decoder) 306. The 
20 CODEC 306 serves to encode signals (such as audio/video signals) into a network- 
compatible data stream for transmission over the network 102. The CODEC 306 
also serves to decode a network-compatible data stream received from the network 
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102. As depicted, the CODEC 306 may be implemented as a hardware component. 
Alternatively, or in addition, software encoding and decoding may be used. The 
CODEC 306 may use various algorithms, such as MPEG, for encoding and 
decoding. The CODEC 306 may be directly or indirectly coupled to the video 
5 camera 1 14 in order to encode a captured video signal. 

The STB 108 may further include a memory device 308, such as a random 
access memory (RAM), to store data for temporary use. Similarly, a read-only 
memory (ROM) may be provided for storing more permanent data, such as fixed 
code and configuration information. 

10 In one implementation, an audio/video (AN) controller 310 is provided for 

converting decoded digital audio/video information into analog or digital signals for 
display/playback on the television 1 10 or other display devices. The AN controller 
310 may be implemented using one or more physical devices, such as separate 
graphics and audio interfaces. Preferably, the AN controller 310 includes graphics 

15 hardware for performing bit-block transfers (bit-blits) and other graphical operations. 

The STB 108 may also include a storage device 312, such as a hard disk 
drive, or the like. The storage device 312 may record encoded television broadcasts 
and retrieve the broadcasts at a later time for decoding by the CODEC 306 and 
display by the AN controller 310. Thus, the storage device 312 may be used in PVR 
20 (personal video recording) applications, such as time shifting, pausing (buffering) live 
video, etc. 
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The storage device 312 may also be used, in various embodiments, to store 
viewer preferences, parental lock settings, electronic programming guide (EPG) 
data, programming preferences, passwords, e-mail messages, and the like. In one 
implementation, the storage device 312 also stores an operating system (OS) for the 
5 STB 108, such as Windows CE® or Linux®. 

As previously noted, the STB 108 may include a wireless receiver 204 for 
receiving control signals from the wireless transmitter 202 of the remote control 118. 
The wireless transmitter 202 may rely on various wireless technologies, such as IR 
or RF. 

10 A CPU 314 controls the operation of the STB 108, including the other 

components thereof, which are coupled to the CPU 314 via a bus 316. The bus 316 
serves a communication channel between many components (e.g., CPU 314, 
Interface 302, storage device 312) of the STB 108. The CPU 314 may be embodied 
as a microprocessor, a microcontroller, a digital signal processor (DSP) or other 

15 device known in the art. For example, the CPU 314 may be embodied as an Intel® 
x86 microprocessor. The CPU 314 may perform various logical and arithmetic 
operations in response to control signals generated by the remote control 118. 

Of course, FIG. 3 illustrates only one possible configuration of an STB 108. 
Those skilled in the art will recognize that various other architectures and 
20 components may be provided within the scope of the invention. In addition, various 
standard components of typical STB 108 are not illustrated in order to avoid 
obscuring aspects of the invention. 
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FIG. 4 is a block diagram of logical components of system 400 for providing a 
webcam-based interface for initiating two-way video communication. The depicted 
logical components may be implemented using one or more of the physical 
components shown in FIG. 3. Additionally, or in the alternative, various logical 
5 components may be implemented as software modules stored in the memory 308 
and/or storage device 312 and executed by the CPU 314 . Those skilled in the art 
will recognize that various illustrated components may be combined together or 
integrated with standard components in various configurations without departing 
from the scope or spirit of the invention. 

10 As shown in FIG. 4, a plurality of video streams 401 are generated by video 

cameras 1 14 coupled to terminals within a network. The terminals may be 
embodied as STBs 108, personal computers 112, or the like. As depicted, the 
network may be embodied as a cable network, which includes a plurality of head- 
ends 116. Of course, other terminals, networks, and network architectures may be 

15 used within the scope of the invention. 

In one embodiment, the STB 108 includes a stream reception component 
402, which receives the plurality of video streams 401 from a head-end 1 16 or other 
intermediate network node. As noted above, the video cameras 1 14 may be 
configured as webcams, which include hardware and software for providing 
20 continuous, one-way video streams 401 to any requesting client module, such as a 
Web browser. In such an embodiment, the stream reception component 402 may 
be embodied as the component of a Web browser that is responsible for receiving 
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and processing video streams 401 received from webcams. However, the stream 
reception component 402 need not be implemented in the context of a Web browser. 

The particular video streams 401 received by the stream reception module 
402 are preferably user selected. For example, the user may specify a network 
5 address of each camera-equipped terminal for which a video stream 401 is desired. 
The stream reception component 402 then interacts with the selected terminal to 
establish a one-way video communication channel. Various standard protocols may 
be used, well known to those skilled in the art. 

In certain embodiments, the selection of video streams 401 may be dictated 
10 by a user's video phonebook 404, which may list the network addresses of various 
terminals with which the user regularly establishes video communication. A user 
interface (not shown) may be provided for allowing a user to select terminals from 
the videophone book 404 from which to receive video streams 401 . Of course, 
security protocols may be provided to limit a user's access to terminals and video 
15 cameras 1 14 from which the user is authorized to receive video streams 401 . 

In one embodiment, the STB 108 also includes a stream display component 
406, which simultaneously displays the video streams 401 (or a subset thereof) on a 
display device, such as a television 110. As explained in greater detail below, the 
stream display component 406 may arrange the video streams 401 in different 
20 formats, such as a grid format or a ticker format. The stream display component 406 
may be implemented, in certain embodiments, as the component of a Web browser 
responsible for displaying video streams 401 received from multiple webcams. 
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In one configuration, the video streams 401 are "live", e.g., not recorded and 
subsequently broadcast at a later time. However, live video streams 401 may be 
received by the stream reception component 402 after a delay due to network 
latency and other factors. Thus, the term "live" should not be confused with "real 
5 time". 

In certain embodiments, the video streams 401 are not live, but include pre- 
recorded information. For example, when a video camera 1 14 for a terminal is not 
active, a pre-recorded video stream 401 may be transmitted. The pre-recorded 
video stream 401 may include moving or static images. Alternatively, an STB 108 
w may store pre-recorded video streams or images to display in the event that a video 
camera 1 14 is not active. For example, the STB 108 may store a picture of a user 
associated with a particular webcam, which is displayed when the user's webcam is 
not transmitted live video. 

The STB 108 may further include a stream selection component 408 which 
15 allows the user to select one of the displayed video streams 401 . As described in 
more detail below, the stream selection component 408 may allow a user to select a 
video stream 401 by moving a selection outline (not shown) around a desired stream 
and pressing an appropriate button on the remote control 118, such as the "Call" 
button 226 or the "Cache" button 228 (illustrated in FIG. 2). 

20 The STB 108 may further include a video communication component 410 in 

communication with the stream selection component 408. In one embodiment, 
when a user selects a video stream 401 and presses the "Call" button 226 (or other 
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suitable control), the video communication component 410 establishes two-way 
video communication between the user's terminal (e.g., STB 108) and the terminal 
from which the selected video stream 401 originated. Systems for two-way video 
communication using video cameras 114 are known, such as Microsoft 
5 Netmeeting®, CUseeMe®, or the like. As illustrated in FIG. 1 , the two-way video 
communication may be facilitated by various networks, such as such as broadband 
communication network 102, a telephone network 104, or the Internet 106. 

Providing a video communication component 410 in communication with the 
stream reception and selection components 402, 408 allows a one-way (e.g., 

10 webcam) video communication channel to be converted into a two-way video 
communication channel. This is advantageous in that a user of the STB 108 may 
visually determine whether a user of another system is available before attempting 
to establish two-way video communication. Additionally, the user of the STB 1 08 
may periodically monitor the camera output of the other system and selectively 

15 establish two-way video communication with that system if desired. 

In certain embodiments, the STB 108 further includes a stream caching 
component 412. The stream caching component 412 may cache the plurality of 
video streams 401 (or selected ones) for a period of time. The video streams 401 
may be cached, for example, within the storage device 312 of the STB 108. In one 
20 implementation, the stream caching component 412 caches the plurality of video 
streams for a pre-determined time period, which may be hardwired or user selected. 
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Alternatively, the caching period may be limited by the storage space available 
within the storage device 312. 

In one embodiment, after a user selects a video stream 401 and presses the 
"Cache" button 228 (or other suitable control), the stream caching component 412 
5 retrieves a cached, earlier-in-time segment of the selected video stream 401 and 
provides the same to the stream display component 406 for display. For example, 
the stream caching component 412 may retrieve a cached copy of the last five 
minutes of a selected video stream 401 , which is then displayed on the television 
1 10 by the stream display 406 component. 

10 The cached segment may be displayed by the stream display component 406 

in forward or reverse direction. For example, playback of the cached segment may 
commence at the end of the cached segment (which may correspond to the time at 
which the "Cache" button 228 was pressed) and proceed in reverse direction to the 
start of the cached segment. Alternatively, playback may begin at the start of the 

15 cached segment and proceed to the end thereof. 

In certain embodiments, the video streams 401 are not cached locally within 
the receiving STB 108. Rather, the video streams 401 are cached within another 
computer, possibly within the head-end 1 16, or another suitable location. 
Centralized caching of video streams 401 may be advantageous where a plurality of 
20 users are receiving the same video streams 401 . Thus, separate copies of the video 
steams 401 need not be stored within each STB 108. In other embodiments, the 
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video streams 401 are cached within the terminal from which the video streams 401 
originated. 

In such embodiments, a cache interface 414 is provided to communicate with 
the head-end 116 or other remote caching location to retrieve the cached video 
5 stream 401 . The cache interface 414 may also be used to send video streams 401 
captured by the local camera 114 associated with an STB 108. 

In the above-described embodiments, several individual video streams 401 
are received by the STB 108. In an alternative embodiment, as shown in FIG. 5, a 
stream reception component 502 within a head-end 1 16 or other intermediate 
10 network node may receive the individual video streams 401 . Thereafter, a stream 
compositing component 504 combines the multiple video streams 401 into a single, 
composite video stream 505. A stream transmission component 506 then transmits 
the composite video stream 505 to the STB 108, where it is displayed as described 
above. 

15 From the user's perspective, the display of the composite video stream 505 

may be identical to the embodiment in which individual video streams 401 are 
received by the STB 108. For example, an individual video "stream" may still be 
selected within the composite video stream 505 in order to establish two-way video 
communication with a corresponding remote terminal or to access a cached video 

20 segment. However, from the perspective of the STB 108, receiving a single 

composite stream 505 generally reduces the processing requirements of the CPU 
314. Thus, the STB 108 may be manufactured with lower cost components. 
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As previously noted, the head-end 116 may include a stream caching 
component 412 for centrally caching video streams 401 for a plurality of users. 
Segments of the cached video streams 401 may then be retrieved by individual 
STBs 108 for display. 

FIG. 6 illustrates a plurality of video streams 401a-401d being displayed on a 
television 110. As noted, the video streams 401a-401d may be displayed in various 
configurations, such as a grid configuration, in which the individual video streams 
401a-401d are arranged in rows and columns. Of course, a single row or column 
are within the scope of the invention. 

Preferably, the video streams 401a-401d are selectable. In one embodiment, 
a user may select, for example, video stream 401a by moving a selection outline 602 
around the video stream 401a. In alternative embodiments, a pointer (not shown) or 
other selection mechanism could be used. The selection outline 602 is moved, in 
one embodiment, by means of navigation buttons on the remote control 118, such 
as the up, down, left, and right buttons 208, 210, 212, 214. In other embodiments, a 
mouse, trackball, or other pointing device could be used for the same purpose. 

In response to the user pressing the "Call" or "Cache" buttons 226, 228, the 
plurality of video streams 401 may be replaced, in one embodiment, by a single, 
enlarged video communication window 702, as shown in FIG. 7. The video 
communication window 702 may be used for two-way video communication (in 
conjunction with the video communication module 410) or for displaying cached 
video stream segments (by means of with the stream caching module 412). 
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As shown in FIG. 8, the video communication window 702 need not replace 
all of the displayed video streams 401 . This allows the user to continue to monitor 
other video streams 401 while engaging in two-way video communication or while 
watching cached video stream segments. In certain embodiments, none of the 
5 video streams 401 are replaced by the video communication window 702, but are 
simply reduced in size proportionately to fill the area not used by the video 
communication window 702. 

In an alternative embodiment, as shown in FIG. 9, a plurality of video streams 
401a-401d may be displayed in a ticker format. In one configuration, the displayed 
10 video streams 401 a-401d move across the television 1 1 0 from right to left in a 

carousel fashion. For example, when the video stream 401 d passes beyond the left 
hand side of the television 110, it reappears on the right hand side. 

Of course, the ticker format may be configured in various different ways 
without departing from the spirit and scope of the invention. For example, the video 
15 streams 401 may move in any direction. Moreover, more than one row or column of 
video streams 401 may move across the screen simultaneously. 

As in the case of the grid format, a user may select a video stream by 
controlling a selection outline 602, pointer, or other selection mechanism. Once a 
selection has been confirmed (e.g., using the "Call" button 226 or the "Cache" button 
20 228), a video communication window 702 (not shown) may be displayed, as 
previously described. 
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Based on the foregoing, the present invention offers numerous advantages 
over conventional systems. In particular, a user may monitor video streams 401 
from multiple cameras 1 14 at a single location. Thereafter, by selecting one of the 
video streams 401 , a user may establish two-way video communication with the 
terminal associated with the selected video stream 401 . Alternatively, the user may 
retrieve and display cached video footage from an earlier-in-time segment of the 
selected video stream 401 . 

While specific embodiments and applications of the present invention have 
been illustrated and described, it is to be understood that the invention is not limited 
to the precise configuration and components disclosed herein. Various 
modifications, changes, and variations which will be apparent to those skilled in the 
art may be made in the arrangement, operation, and details of the methods and 
systems of the present invention disclosed herein without departing from the spirit 
and scope of the invention. 

What is claimed is: 
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